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Abstract. The aim of this paper is to propose a methodology for testing general hypothesis in 
a Markovian setting with random sampling. A discrete Markov chain X is observed at random 
time intervals Tk, assumed to be iid with unknown distribution /x. Two test procedures are 
investigated. The first one is devoted to testing if the transition matrix P of the Markov chain 
X satisfies specific affine constraints, covering a wide range of situations such as symmetry or 
sparsity. The second procedure is a goodness-of-fit test on the distribution p, which reveals 
to be consistent under mild assumptions even though the time gaps are not observed. The 
theoretical results are supported by a Monte Carlo simulation study to show the performance 
and robustness of the proposed methodologies on specific numerical examples. 

Keywords: Asymptotic Tests of Statistical Hypotheses ; Markov Chain ; Random Sampling. 

1. Introduction 

It has been widely recognized that discrete Markov chains are a powerful probabilistic tool 
to study many real phenomena in different application fields. Interests from economics, finance, 
insurance and also medical research are just few examples. From a theoretical point of view, 
alternative technical frameworks have been defined and developed in the literature to perform 
statistical inference in Markovian models: multiple Markov chains Eua, hidden Markov pro¬ 
cesses mmm, random walks on graphs [10] are well known mathematical settings used to 
describe the evolution of real events. 

In this paper, we investigate hypothesis testing issues in a Markov model with random sam¬ 
pling. In the spirit of [3], a discrete homogenous Markov chain (A m ) me p;j is observed at random 
times so that the only available observations consist in a sub-sequence of the initial process. 
Hereafter we denote by (Yk)k£N the observed process. The time gaps Tk (i.e. the number of 
jumps) between two consecutive observations are assumed non-negative, independent and identi¬ 
cally distributed from an unknown distribution [i. A statistical methodology is proposed to test 
specific hypothesis on the transition matrix P or the distribution [i having observed (Y \, ...,Y n ). 

Our interest is to answer the following questions: can we test some specific hypothesis on the 
transition matrix P of the initial chain X when neither the time gaps Tk nor their distribution 
/r are known? Additionally, can we perform a goodness-of-fit test on the distribution fi of the 
time gaps in this framework? 

As explained in [3] , a typical application of this setting occurs when considering a continuous 
time Markov chain Z = o observed at a discrete time grid t\ < ... < t n . In this situation, 

X represents the jump process of Z and (ifcjfceN denotes the number of jumps occurring between 
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two consecutive observations, Yj, := Zt k . If the discrete time grid is chosen independently from 
the chain, the time gaps Tfc are independent random variables unknown to the practitioner. 
Their distribution // is Poisson if Z is observed at regularly spaced time intervals, but one can 
easily imagine a different distribution /i if t\, ...t n are affected by undesired random effects. This 
framework can describe many real phenomena: chemical reactions [T], financial markets [12], 
waiting lines in queuing theory [9] , medical studies [7| . 

In 13], the sparsity of P reveals to be a crucial assumption for the proposed estimation method¬ 
ology. In this paper, we extend the statistical setting defined in [3] by relaxing the sparsity 
assumption and working under a more general framework where the information about P can 
be expressed in the form of affine constraints. It is then possible to incorporate some prior 
information on P such as symmetry, reflexivity or even simply the knowledge of some of its 
entries. The sparsity hypothesis considered in |3] can be retrieved as a particular case. 

In our model, the observations Yi,..., Y n are the realizations of an homogenous Markov chain, 
with a transition probability Q that can be written as an analytic function of P. As in [3], 
neither the transition matrix P nor the distribution // are known and the time gaps t^ ; between 
two consecutive observations Yj._i,Yj. are unavailable. Our aim is now to perform hypothesis 
tests on both the transition kernel P and the time gaps’ distribution p. Theoretical results are 
separated in two consecutive steps. As a first step, we describe how to build an hypothesis test 
on P to study if the transition kernel P satisfies additional affine constraints. This framework 
can be used to test a wide range of hypothesis such as symmetry, sparsity or even to test a par¬ 
ticular value of P or some of its entries. As a second step, we propose a procedure to test specific 
values for the distribution of time gaps fi, considering an hypothesis of the form Hq : /i = /Jo, 
given some preliminary information on P. Both tests are asymptotically exact and rely on the 
asymptotic normality of the empirical transition matrix Q. A Monte Carlo simulation analysis 
is performed in order to highlight the performance of the proposed test. 

The paper is organized as follows: Section [2] provides a detailed description of the statistical 
problem and discusses the identifiability issues encountered in this framework. Section [3] de¬ 
fines an asymptotically exact hypothesis test for the transition kernel P. Section [4] shows the 
theoretical framework to build an hypothesis test on the distribution jjl. Section [5] supports the 
analysis of Sections [3] and [4] with numerical results from a Monte Carlo simulation study. Proofs 
are postponed in the Appendix. 


2. The problem 

We consider an irreducible homogenous Markov chain X = (A m ) me pj with finite state space 
£ = {1,..., N}, N > 3 and transition matrix P. We assume that X is observed at random times 
Ti,...,T n so that the only available observations consist in the sub-sequence Yj, := Xx k ,k = 
1, ...,n. The numbers of jumps t^ := T^-i — Tfc between two observations Yj, are assumed to 
be iid random variables with distribution p on N and independent from X. In this setting, the 
resulting process Y = (Yk)km remains Markovian (see 0) with transition matrix 

(1) Q:=G /i (P) = ^PVW, 

i> o 

where G^ : [—1,1]—>K denotes the generator function of p. We assume that some information 
is available on P in the form of an affine constraint 


Ap = b 
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where p = vec (P) = (Pn, Pni, ■ Pin, • Pnn) T is the vectorization of P, A e M rx7V2 is a 
known full ranked matrix and b 6 M r a known vector. This affine constraint defines the model 

(2) Xi := {m € : Am = b}, 

that is, the set of admissible values for p = vec(P). Each additional affine condition satisfied 
by P reduces by one the dimension of the model. Therefore, the rank r of A indicates the 
information available on P. In the sequel, we denote by d the dimension of the model Af, given 
by d = dim(AI) = N 2 — r. 

Many different forms of information on the initial chain X m can be expressed by affine con¬ 
straints, as described in the following examples. 

a) PI = 1, where 1 = (1,..., 1) T , is an important information that can be expressed as an 
affine constraint on P by 

(1 T <g) I) vec(P) = 1, 

where <S> stands for the Kronecker product. 

b) In [7], the author investigates the progression of a disease using a Markov chain observed 
at unequal time intervals. For this particular application, the transition matrix of the 
chain is expected to be triangular: in this case the information can be represented with 
N (IV — l)/2 linear conditions on P. 

c) If the initial chain X cannot remain at the same state on two consecutive times, the 
transition matrix P is known to have zero diagonal. This information can be expressed 
in the form of N linear constraints on P. 

d) More generally, if one or several state transitions are known to be impossible in the 
initial chain (X m ) mg j<j, the nullity of the corresponding entries can be expressed as a set 
of linear conditions on p. The number of constraints is equal to the number of known 
zeros in P. This situation is treated in [3]. 

e) A symmetric transition matrix P corresponds to N{N — l)/2 linear constraints on p = 
vec(P). 

f) P is doubly stochastic, one can include the condition P T 1 = 1 as N — 1 additional affine 
constraints. 

g) X is a reversible Markov chain, its transition matrix P satisfies the linear conditions 

7 TiPij = 7TjPji for all i,j = 1 where ir = (ni ,..., ttn) is the invariant measure. 

Although 7r is unknown in practice, it can be estimated directly from the observations 
since it is also the invariant measure of Q. 

h) The knowledge of some entries of P being equal, or equal to a known value c G [0,1] can 
be expected in certain practical situations (e.g. the transition probability distribution 
from a given state i is uniform). These are of course particular examples of affine 
constraints on P. 

The positivity of the entries of P appears to be more difficult to handle than affine constraints. 
Nevertheless, the positivity is somewhat less informative as it does not reduce the dimension of 
the model. In this paper, we choose to neglect this information on P for simplicity. 

To avoid critical situations, we assume that (X m ) mg N is an aperiodic Markov chain, in which 
case the transition kernel P has a unique invariant distribution 7r = ( 711 ,..., 7T/v), where i Tj is 
positive for all i = 1,..., N. From the relation Q = G^P), we know that, like P, Q is aperiodic 
recurrent and they share the same invariant distribution. Thus, 7r can be estimated consistently 
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from the observations Yj ; , along with Q by the empirical estimators 


(3) 


VT,; 


n 


= - 1 {Yk = i} , Qij = 

k= 1 




fc +1 


E^i 1 im = <} 


3} 


for all i.j = 1, provided that the state i has been observed at least once. It is known (see 

for instance m ) that Q is asymptotically unbiased and q = vec (Q) is asymptotically Gaussian, 

Vn(q-q) AA(0,S). 

The matrix S € R N ~ xN ~ can be deduced from the asymptotic behavior of the covariances 

( Qij(l~Qij)/ni if (i,j) = (M), 

(4) = 1, lim ncov(Qij,Qke) = < -QijQu/^i if i = k, j ^ £, 

0 otherwise. 

Remark that E is a singular matrix since q — q can only take values in a linear subspace of 

gJV 2 xiV 2 

Since the distribution of the time gaps Tfc is unknown, the main information available on P is 
that it commutes with Q. Thus, given the model M, the possible values for p can be restricted 
to A4nCom(Q), where Com(Q) is the commutant of Q. As in [3j, we consider the commutation 
operator m i— > A {Q)m = vec(MQ — QM ) where 

A(Q) = 1 ® Q-Q r ®I. 

Of course, since Q is unknown, we use the empirical estimator Q to retrieve some information 
on P. 


3. Hypothesis tests on P 

Given that the transition matrix P of the initial Markov process belongs to some 

model A4 of the form ([2]), we want to test additional affine conditions onp = vec (P), via the 
null hypothesis 1~Lq : Aop = bo, where Ao € R kxN ^ is a full ranked matrix, bo € R k and k > 1 
is the number of new conditions to test. To avoid technical issues, the new constraints must be 
compatible with the model, i.e. there exits at least one element m E A4 such that Aom = bo- 
Moreover, we assume without loss of generality that the new constraints are not redundant, in 
the sense that each row of Ao brings a new information on P. Formally, this means that the 
model under 'Ho, 

Mo := {m € 1 ^" : Am = b, Aom = 60} C M, 

is of dimension d—k where we recall d = dim(A4). Let (j) i,..., 4>d-k be a basis of ker(A)nker(Ao), 
extended by cf>d-k+i, ■■■, 4>d to form a basis of ker(A). The initial model M and the constrained 
model Mq can be expressed as 

M = {p 0 + 4>/3 : /3 e R d } and Mo = {po + ^o7 : 7 £ R d ~ k }, 

setting <1? = {(j)\,...,(j)d) G R N2xd , 4>o = {$1, —,<j>d-k) G M Ar2x ( d_fc ) and with po an element of 
Mo- Define the linear sets 

E = Im(A(<5) < I>) = {A (Q)m : m €E ker(A)} 

Eq = Im(A(<5) < I>o) = {A (Q)m : m G ker(A) n ker(Ao)} 

F = E n Eq, 

where Eq stands for the orthogonal complement of Eq in R n2 . Note that since Eq C E, we 
have E = Eq (B F. Moreover, we shall denote by n^;, and the orthogonal projectors in 
R n ~ onto E, Eq and F respectively. We make the following assumptions. 
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Al. The problem is identifiable under Po, i.e. Com(Q) D ker(A) n ker(Ao) = {0}. 

A2. The dimension of E is maximal over the set of stochastic matrices with the same support 
as Q : 

dim(P) = rank(A(Q)$) = max rank(A(Q / )<L). 

Q ■ Q 1 — 1 
supp(Q')Csupp(Q) 

The equivalence, stated in Al, between the identifiability and the matrix A(Q)^o being of full 
rank is mentioned in |3]. Here, the identifiability under Po means that P is fully characterized 
by the condition p E Mo and the fact that P and Q commute. Theorem 3.2 in |3] ensures the 
existence of a consistent estimator of p as soon as the hypothesis Po is true. Remark that unlike 
in [3j, we do not require the initial model M to be identifiable, although if it is the case, the 
two assumptions always hold. 

Though the assumption A2 is quite technical, it is very mild, since the set of matrices Q' for 
which the rank of A(Q 7 )$ is maximal is a dense open set. The argument is similar to the one 
used in the proof of Lemma 2.1 in [3]. 

Hereafter we denote by %g(H r ) the centered generalized chi-squared distribution 

XcAW) = e T We, 

where e is a standard Gaussian vector in M. N ~ and W E M. n2xn2 . Remark that the above 
distribution is simply a Dirac mass at zero if W is the null matrix. 

Theorem 3.1. Suppose that Al and A2 hold. Under Po : A$p = bo, as n —> oo the statistic 

S = n( inf ||A(Q)m|| 2 — inf ||A(<5)m|| 2 

y mSMo m£j\4 

converges in distribution to a X^(W) as n —> oo where 
(5) W := n F A(P)ZA(P) T n F 

The proof is given in Appendix. 


This result does not rule out the possibility that W is the null matrix (the limit distribution 
is 0 in this case), however, it is of no use to perform the test in this situation. 


Theorem |3.1| alone is not sufficient to build a test for Po since the limit distribution is unknown 
in practice. Nevertheless, W can be estimated consistently under Po using estimates of np, P 
and £. 


- An estimator of £ can be easily obtained by using the plug in principle. We replace in 
Q the parameters it and Q by the empirical versions if and Q defined in ([3]) . Since 7r* 
is positive for all i = 1,..., N, the resulting estimator £ is consistent. 

- In [3], the authors show that a consistent estimator of P can be obtained if the problem 
is identifiable. In our framework, the identifiability under Pq assumed in Al reduces to 


{P} = Mo H Corn(Q). 

From Lemma 6.1 in j3j, this condition is actually equivalent to A(Q) < I > o being of full 
rank, in which case 

(6) p= (l-$o(^A(Q) T A(Q)$o) _1 d>«[A(Q) T A(Q))po 

is a consistent estimator of p under Po■ 
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- In the proof of Theorem |3.1| (see Appendix) an estimator of II^ is obtained by considering 
the linear space 

F = Im(A(Q)4>) n Im(A(Q)$ 0 ) ± - 

Under A1 and A2, we prove that the resulting projector 11^-, converges in probability to 
U F as n —> oo. 

By plugging the estimates £, P and II^ in ([5]), we build 

W ■= HpA(P)±A(P) T U p . 

which is a consistent estimate of W under 'Ho- As a result, the limit distribution of the statistic 
S can be approximated by x^{W). This leads to the following result. 

Corollary 3.2. Assume that A1 and A2 hold. Let u\- a denote the quantile of order 1 — a of 
the Xq(IU) distribution for a G (0,1). IfW / 0, we have under Ho : AqP = bo, 

lim P (S > iii-a ) = ol. 

n—>oo 

Moreover, if'ip' G Mo : A (Q)p' / 0, then lim P(5 > u\- a ) = 1. 

n—>• oo 

The proof is given in Appendix. 

Because the quantiles of the generalized y 2 distribution are not available in an analytic form, 
they are approximated by Monte-Carlo methods, following |8j. 

4. Test on p 

The estimation of the transition kernel P is based on the property that P and Q commute, 
which is due to the existence of an analytic function G tl (■) such that Q = G fl (P) defined in 
([!]). However, no particular interest has been given so far to the actual value of G^f ), since the 
knowledge of the distribution p is not needed to build the test procedure on P. 

We emphasize that in our framework it is in general not possible to exactly recover the dis¬ 
tribution n, since different distributions p and u may lead to the same image G V (P) = G fl (P). 
Nevertheless, questions regarding the number of jumps between consecutive observations, or 
their distribution p can naturally arise in practical cases. Suppose for instance that the obser¬ 
vations Yi come from a continuous process (Ab)t>o observed at regular time intervals. Assuming 
that nothing other than the Yf s is known, one might be interested in checking if the underlying 
process (Xt)t >o is a continuous time Markov chain. This hypothesis relies on the distribution 
of the time intervals between two consecutive jumps of Xf. If the process (Xt)t >o is in fact 
Markovian, the times between jumps are drawn from an exponential distribution, resulting in 
Poisson random variables t^. In such a case, testing this hypothesis on p reduces to verify if G^ 
is the generator function of a Poisson variable, that is, if G^(t) = for some A > 0. 

In this section we consider hypotheses of the form Ho : p = po- We assume that the model is 
identifiable, in which case we can build a consistent estimator of p as follows 

P= (i-$($ t A(Q) t A(Q)$) _1 T t A(Q) t A(Q)) po. 

From Theorem 3.2 in [3], we know that p satisfies 

(7) Vn (p-p) = Vn B(q- 


q) +o P ( 1) 
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where B = 4* [$ T A(Q) T A(Q)4>] 1 $ T A(Q) T A(P). 

The procedure to perform a test on the hypothesis Ho : g = go relies on the comparison be¬ 
tween Q and G Y /t (P). Since Q = G Y /t (P), the consistency of P implies that Q — G fJj (P) converges in 
probability to zero. Moreover, we can derive the asymptotic distribution of g^(P) ■= vec (G M (P)), 
leading to the following result. 

Proposition 4.1. If the time gaps distribution g satisfies the moment condition E m (t) < oo, 
then the matrix T = ^ fc>1 ( Yl!j= 1 (P :,_1 ) T ® P k ~^)g{k) is well defined and 

V^(q ~ g^P)) A W(o, (I —TP)E( I -rP) T ). 

The proof is given in Appendix. 

Consequently we get, under Ho, 

n\\^g, 0 (P)\\^ X U(l-TBMl-rBV). 

This result is not sufficient to build the statistical test on the distribution g. So, we proceed sim¬ 
ilarly as before by using consistent estimates of T, B and E to replace the unavailable true values 
in order to approximate the limit variance. Then, the resulting test statistic has critical region 
of the form {n\\q — g fl0 (P)\\ 2 > ui- a } where u\- a is the quantile of the — TP)E(I —fP) T ) 

distribution. 


5. Computational study 


This section is devoted to a Monte Carlo simulation analysis for three hypothesis tests pro¬ 
posed in this paper. In the simulations, the initial Markov chain X behaves like a reflected 
random walk. The chain takes values in some finite space having N = 10 states. Our ultimate 
aim is to test the assumption P = Pq, where Pq is given by 


( 8 ) 


^0 = 


010 ... 

0.5 0 0.5 : 

0 0 

: 0.5 0 0.5 

...010 


Assuming that nothing is known on the transition kernel P, we divide the hypothesis tests 
on P into three steps. First, we want to test if the support of P is contained in that of Po, 
i.e. if only the upper and lower diagonals of P have non-zero entries. Secondly, we investigate 
the test procedure for the null hypothesis P = Po, under the assumption that the support of P 
is restricted to that of Pq. Finally, a third test is carried out for hypothesis on the distribution g. 


We perform the three tests for four sample sizes: n = 200, n = 500, n = 1000 and n = 2000. 
Under the null hypothesis, we simulate the Markov chain X = (A m ) mg pj with transition matrix 
Po and the times gaps r^ are drawn from a Poisson distribution g = V(l). Thus, the available 
observations are Y j. := Xx k where Ty. = X^j'=i T fc for k = l,...n. We recall that neither the 
initial Markov chain X nor the time gaps are observed and the distribution g of the r^. is 
unknown to the practitioner. The nominal significance level is fixed equal to a = 5% for all 
numerical experiments. 
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Test 1. We wish to test if the support of P is contained in that of Po, i.e. if only the upper 
and lower diagonals of P have non-zero entries. The hypothesis is 

: supp(P) C supp(Po), 

and the model is maximal, i.e. it contains all stochastic matrices: = {vec(M) : 

Ml = 1}. By Monte-Carlo method, the significance level is estimated for the four sample sizes 
using 10 4 replications. Results are gathered in Table [I] 


n 

200 

500 

1000 

2000 

a 

0.131 

(0.0034) 

0.064 

(0.0024) 

0.059 

(0.0024) 

0.056 

(0.0023) 


TABLE 1. Test for hypothesis Pf, 1 ' : supp(P) C supp(Po) in the model J\A^ = (vec(M) : 

Ml = 1}. The table contains summary statistics of Monte Carlo simulation results obtained 
with P = P 0 given in Eq. |8|, for the four different sample sizes n and for p = 'P(l). The 
probability to reject TL ^ is estimated for a nominal size a = 0.05, based on 10 4 replications. 

Standard errors are given in brackets. 

Table [I] shows that the estimated significance level converges to the nominal size 5% as n in¬ 
creases. It seems that the test is accurate as soon as the sample sizes is greater than 500. 

To evaluate the power of the test, we generate a Markov chain X with transition matrix 
Pt = tP 0 + (1 — t)C where C is a stochastic matrix with full support drawn randomly before¬ 
hand (the same value of C is used during the whole simulation study). We estimate the power 
for different values of t varying in (0,1) with 0.1 increments. The empirical reject frequencies 
under the alternative hypothesis are given in Figure [TJ 

For small sample sizes, e.g. n = 200, the estimated type I error probability (~ 0.13 corre¬ 
sponding to t = 1) is far from the nominal value a = 0.05. The test is not powerful in this 
case as we can see that the power remains smaller than 0.4 for t E (0,1) and increases in a 
neighborhood of 1, which indicates a bias. Nevertheless, the theoretical results are verified for 
sample sizes as large as n = 500 and the test is no longer biased. Both the estimated powers 
and significance level appear to converge to the expected values as n grows. 

Test 2. We investigate the test procedure for the hypothesis 

nf : P = Po, 

assuming that the support of P is restricted to that of Po- Note that in this case, the first and 
last row of P are known since they contain only one non-zero element. The model is given by 

M^ = { vec(M) : supp(M) C supp(Po)}. 

The estimated significance levels are given in Table 2. 

The convergence seems slower in this case compared to the previous test procedure, with 
an estimated significance level not quite in a two standard-deviation range of objective value 
for n = 2000. The slow convergence is partly explained by the approximation of the limit 
distribution by the generalized chi-squared ’X^{W) obtained with the estimated matrix IV. 
Nevertheless, the convergence in distribution of the test statistic to the Xg( W) is clearly observed 
in Figure [2j 
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0.2 

0.131 ■ 




FIGURE 1. Test ofP^ : supp(P) C supp(Po) in the model At* 4 ' = {vec (M) : All = 1}. 
The graphs represent the estimated power of the test under the alternative P = tPo + (1 — t)C, 
for the four sample sizes n = 200 (top-left), n = 500 (top-right), n = 1000 (bottom-left) and 
n = 2000 (bottom-right). The power is estimated for a nominal size a = 0.05, based on 10 4 
replications. 


n 

200 

500 

1000 

2000 

a 

0.255 

(0.0044) 

0.104 

(0.0031) 

0.075 

(0.0026) 

0.057 

(0.0023) 


TABLE 2. Test for the hypothesis 'Hq 2 ' 1 : P = Po in the model AA^ 2 ' 1 = {vec(A I) : supp(Al) C 
supp(Po)}- The table contains summary statistics of Monte Carlo simulation results obtained 
with P = Po given in Eq. |8|, for the four considered sample sizes and for /x ~ T-’(l). The 
probability of rejecting Tl q 1 ' is estimated for a significance level a = 0.05, based on 10 4 replica¬ 
tions. 


Test 3. We now focus on the distribution fi of the random sampling (rfc)fc We want to test the 
hypothesis 

n ( 0 3) :p = p 0 = V(l), 

in the model A4^- 2 \ This means that we perform the test to see if p is a Poisson distribution 
with parameter 1, knowing that the support of P is contained in the upper and lower diagonals. 
This model satisfies the identifiability assumption Al. 

We estimate the probability of rejecting the null hypothesis when the time gaps Ty are dis¬ 
tributed from a Poisson distribution V(X), with the parameter A varying in (0.5,1.5) with 0.1 
increments. Table [3] provides the empirical rejection frequency under the null hypothesis A = 1. 
Figure [3] gives the estimated power of the test as function of the parameter A. 
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FIGURE 2. Histograms of the 10 4 realizations of the test statistic for the test ofTL q 2> : P = Pa 
in the model M' 2 ' = (vec(M) : supp(A/) C supp(Po)} ■ The histograms are obtained for n = 500 
(left), n = 1000 (center) and n = 2000 (right) with the density of the limit distribution Xq(W) 
added in straight line. 


n 

200 

500 

1000 

2000 

a 

0.269 

(0.0044) 

0.081 

(0.0027) 

0.056 

(0.0023) 

0.054 

(0.0023) 


TABLE 3. Test for the hypothesis H q 3 ^ : p = V{1) in the model A4 (2 ' = {vec(M) : supp(M) C 
supp(Po)}- The table contains summary statistics of Monte Carlo simulation results obtained 
with P = P 0 for the four sample sizes under the null hypothesis. The nominal size of the test 
is a = 0.05, based on 10 4 replications. 


The power of the test is not satisfactory for n = 200 but it increases significantly with n. 
Although the time gaps are not observed, this simulation confirms that performing a goodness- 
of-fit test on their distribution is still possible in this framework. In particular, the intensity 
of the process assuming a Poisson distribution can be recovered from the indirect observations. 
The hypothesis test turns out to be rather satisfactory provided the number of observations is 
sufficient. 


6. Conclusion 

We develop an hypothesis testing methodology on the transition matrix P and the distribution 
p of time gaps in a Markovian model with random sampling. The original contribution consists 
in the construction of a testing procedure for affine hypothesis on P in this framework as well as 
a test on the distribution p of the observation times. This setting requires additional information 
on P to make the model identifiable. The information is expressed in the form of affine con¬ 
straints on P , thus extending the framework considered in |3j in which P is assumed to be sparse. 

We show that the different forms of information on P, formalized via affine constraints, can 
be sufficient to recover the true distribution p of the time gaps, or at least to build a consistent 
testing procedure on specific values of p. The simulation study confirms that a goodness-of-fit 
test on p can be carried out successfully despite the fact that the time gaps Ty are not observed. 


7. Appendix 


Proof of Theorem 3.1 


We use the fact that, for A an affine subset of an Euclidean space, 
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FIGURE 3. Test for the hypothesis : p = V(l) in the model M <2 ' = (vec(M) : supp(JU) C 
supp(Po)}- The graphs represent the estimated power under the alternative p = V(X), for A in 
(0.5,1.5) and for n = 200 (top-left), n = 500 (top-right), n = 1000 (bottom-left) and n = 2000 
(bottom-right). The power is estimated for a nominal size a = 0.05, based on 10 4 replications. 


for all ao G A, with n_ 4 lin the orthogonal projector onto „4i; n = A — A. Define the linear sets 
E = Im(A((3)<J>) and Eq = Im(A((3)<3?o), we get 

inf ||A(Q)m|| 2 - inf ||A(Q)m|| 2 = ||(I -n^)A(Q)m 0 || 2 - ||(I -n^)A(Q)m 0 || 2 

meMo m&M 0 

for any mo in A4o and in particular, for mo = p under 'Ho- Since Eo C E, the Pythagorean 
theorem yields, under HLo, 

inf ||A(Q)m|| 2 - inf ||A(Q)m|| 2 = ||(n^ - U^)A(Q)p\\ 2 = ||II # A(Q)p|| 2 , 

m£M o meM 0 

setting F = E n Eq . Write IIp,A (Q)p = n F A(Q)p + (lip, — IIp)A((3)p, we have 

sfn U F A(Q)p = -x/h n F A(P)g —^A AA(0, U F A(P)EA(P) T U F ), 

yielding n||IIpA((5)p|| 2 —^A- x 2 (IF). So, to prove the result, it remains to show that 

n(||n F A(Q)p|| 2 - ||n^A(Q)p|| 2 ) = op(1). 

Since y/n\\A(Q)p\\ = Op{ 1), it suffices to show that lip, converges in probability to II F in view 
of 

n(||n F A(Q)p|| 2 -||n^A(Q)p|| 2 ) < n||(n F - n^)A(Q)p|| (||n F A(Q)p|| + ||n^A(Q)p||) 

< |||n F -np||| x2n||A(Q)p|| 2 , 
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where |||.||| denotes the operatorn norm. Recall that 11^, = II^ — IIwith E = Im(A((5)<l>) and 
Eq = Im(A(Q)^o)- By Al, A(Q) < I > o is of full rank and 

= A(Q)$o (^A(Q) t A(Q)$o) _1 d>Q A(Q) t 

clearly converges in probability to Il£ 0 . Moreover, we know by A2 that dirn(T) = Im(A((3)<I>) < 
dim(£'), so that the convergence of A(Q)3> to A(Q)<b is sufficient for 11^ to converge to n^. 
Thus, Up converges in probability to II^? as n —> oo which ends the proof. 


Proof of Corollary 3.2. By continuity of the map (e, W) e->• e We, Slutsky’s lemma ensures 
the convergence in distribution of x^{W) to Xg( W) an d therefore, the convergence of the 
estimated quantile u\~ a to the true quantile u\- a of the Xg(B 0 distribution. It follows that 
linin-s.oo P (5 > ui-a) = P(<S > ui- a ) = ol. Now, if Mp' E Mo : A (Q)p' / 0, we show easily that 
inf me _A/( 0 ||A(<5)m|| 2 does not converge to zero as n grows to infinity, while we still have 

inf ||A(<5)m|| 2 < ||A(Q)p|| 2 —> 0. 

m£M 


Hence, S = n( inf mej vi 0 || A(Q)m,|| 2 — inf mG _,vi ||A((5)m|| 2 ) diverges in probability in this case, 
and thus the result follows. 


Proof of Proposition 4-1. To show that the matrix T exists, it suffices to show that the series is 
normally convergent. We use that |||(-P‘ ?-1 ) T <8> -P fc- - 7 '||| < |||P J_1 1|| |||P fe_J ||| < 1 to get 

E (E iik p ’ _1 ) t ® pl "ii W) < E«‘) = W' 

k> 1 ' j= l ' k> 1 

which is finite by assumption. We now compute the differential of g^ at P. For H E W NxN such 
that \\\H\\\ < 1, we have 

linr l[ G ,{P + tH)-G^P)) = lim- + ^[(P + t H ) k -P k }^k) 


t—>0 t 


k> 1 


= lim - 
t—t 


L k> 1 ^ j=1 


= E W' 

k>l x j=l ' 

Applying the vectorization yields, in view of vec {ABC) = (C* T <8> A) vec (B), 
linr - f (g,(P + tH) - g,(P)) = ^ f E vec {P k ~ j HP^ 1 )) fi(k) 


k>1 x j=1 


L k >1 ' j=l 
= T vec(H). 


vec(H) 
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It follows from Cramer’s theorem that, 

Vn{9i*(P) ~ 0) =VnT(p-p ) + o P (l). 

Using ([7]), we deduce that — q) = \fn TB(q — q) + op(l), which leads to 

Vn {q ~ 9fi{P)) = Vn (I -TB) (q - q) + o P (l), 

and the result follows. 
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